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Abstract 

A measurement of the top-quark pair-production cross section in pp collisions at y/s = 1.96 
TeV using data corresponding to an integrated luminosity of 1.12 fb^" 1 collected with the Collider 
Detector at Fermilab is presented. Decays of top-quark pairs into the final states ev + jets and 
fiu + jets are selected, and the cross section and the 6-jet identification efficiency are determined 
using a new measurement technique which requires that the measured cross sections with exactly 
one and multiple identified 6-quarks from the top-quark decays agree. Assuming a top-quark mass 
of 175 GeV/c 2 , a cross section of 8.5 ± 0.6(stat.) ± 0.7(syst.) pb is measured. 

PACS numbers: 14.65.Ha, 13.85.Ni, 13.85. Qk 
Keywords: top quark, cross section, b-tagging 



I. INTRODUCTION 

The properties of the top quark have been extensively studied since its discovery by the 
CDF and DO collaborations at the Fermilab Tevatron in 1995 |l|, |2j. The it production 
cross section was measured in all detectable decay channels, and good agreement was found 
between the results and the perturbative QCD calculations [3]. At the Fermilab Tevatron, 
a pp collider with a center-of-mass energy of 1.96 TeV, the dominant standard model (SM) 
mechanisms for it production are qq annihilation (85%) and gluon fusion (15%). For a top- 
quark mass of 175 GeV/c 2 the predicted total cross section is a t j = 6. 71^7 pb [4|]. Previous 
measurements have been limited by statistical uncertainties and by the uncertainty in the 
heavy- flavor jet identification efficiency at high energy [5j. 

The top quark decays into a W boson and a b quark almost 100% of the time [3[. The sig- 
nal significance is expected to be greatest in the lepton+jets channel, in which one W decays 
leptonically and the other W decays to quarks. These it events contain a high-momentum 
charged lepton, four jets from the four final-state quarks, and an undetected neutrino. To 



enhance the it purity, at least one jet in t 
mating from a bottom quark (6-tagged) 



re event is usually required to be identified as orig- 
5j. The 6-tagging efficiency, needed as an input to 



the measurement, introduces one of the largest sources of systematic uncertainty [5|. In this 
Letter we present a new technique to measure the it cross section (a t j) and determine the 
6-tagging efficiency in lepton+jets events, which reduces the systematic uncertainty of the 
measurement and allows a determination of the 6-tagging efficiency directly in the it sample. 
This is the first use of the it sample for in situ calibration of the 6-tagging efficiency. The 
improvements in the associated systematic uncertainties benefit directly other analyses, par- 
ticularly searches for the Higgs boson [6j and supersymmetric particles |7J or other scenarios 
of new physics [8j . 

II. SELECTION OF it CANDIDATE EVENTS 

Results reported here are obtained using 1.12 fb _1 of integrated luminosity collected 
between March 2002 and August 2006 by the Collider Detector at Fermilab (CDF II). CDF 
II [9] is a general-purpose particle detector located at one of the two interaction points 
of the Tevatron Collider. Charged-particle tracking is provided by an eight-layer silicon 



detector, surrounded by a 3.1 m long open-cell drift chamber, the central outer tracker 
(COT). Both are contained in a superconducting solenoid with a 1.4 T magnetic field. The 
silicon system provides three-dimensional hit information between radii of 1.4 cm and 28 cm, 
and allows to measure the distance of closest approach of energetic tracks to the event vertex 
in the transverse plane (impact parameter, do) with a resolution of ~ 40 /zm, including 
a 30 /iin contribution from the beamspot. The excellent impact parameter resolution is 
critical to identify displaced tracks which are associated to bottom quarks resulting from 



top-quark decays. The COT covers the pseudorapidity [10| range \t]\ < 1.1 and provides a 
long lever arm for track curvature measurements. Outside the solenoid, electromagnetic and 
hadronic calorimeters arranged in projective towers surround the tracking volume and absorb 
photons, electrons and hadrons with \r]\ < 3.6. Beyond the calorimeters, drift chambers track 
penetrating muons in the region \i]\ < 1.0. 

The data were collected with two high-p^ lepton triggers, one of which requires a high- 
Et electron (Et > 18 GeV) and the other a high-pr muon (py > 18 GeV/c). The trigger 
efficiency is 95.3 ± 1.5% (89.1 ± 1.6%) for identified electrons (muons). Events are selected 
off-line by requiring the presence of an isolated ll|] electron (muon) candidate with Et > 
20 GeV and \rj\ < 1.1 (pr > 20 GeV/c and \r)\ < 1.0), and at least three jets with E T > 20 
GeV and \t]\ < 2. Jets are clustered with a cone-based algorithm with a cone size AR = 
\/A(f) 2 + Ai]' 2 = 0.4, and their energies are corrected for instrumental effects and excess 



energy from additional pp collisions [12j. The primary vertex position along the beam is 
required to lie inside the luminous region (\z\ < 60 cm) and to be consistent with the z 
position of the point of origin of the high-pr lepton. Events with additional identified high- 
Pt leptons, or a track which forms a value compatible with the Z boson mass if combined 
with the primary lepton, are removed to suppress backgrounds. To account for the expected 



neutrino, we require large missing transverse energy 10], Ifi T > 20 GeV, which rejects ~ 50% 



/o 



of background events that do not contain a real W boson. Finally, as tt events typically 



have larger total transverse energy than background events, we require the H? [10| to exceed 
200 GeV. Table [I] includes the event count before 6-tagging (pretag) sorted by the number of 
jets in the event. In the one- and two-jet bins, where background dominates over the signal, 
the Ht requirement is not applied and the samples are used as control samples. 



III. IDENTIFICATION OF 6-JETS 

Requiring at least one jet to be 6-tagged considerably reduces the background. The b- 
tagging algorithm exploits the long lifetime of bottom hadrons by identifying decay vertices 



a. 



inside jets [5j. These vertices are reconstructed requiring a minimum of two or three tracks 
with an impact parameter significance (do/°"<fo) greater than 3.0 or 2.0, respectively. Track 
combinations consistent with a Kg or A are removed, and an upper limit of do = 0.15 cm 
is used to reject interactions with detector material. We measure the two-dimensional dis- 
placement of the secondary vertex from the primary interaction point projected along the jet 
axis (L,2d)- A jet is 6-tagged if the vertex has L 2 d significance (L 2 d/ctl 2D ) larger than 6.0, 
where the uncertainty on L 2 d includes contributions from both the primary and secondary 
vertex fits. The probability of misidentifying a light-flavor jet as a 6-quark jet due to detector 
resolution (mistag rate) is 1.9 ± 0.4%, estimated from secondary vertices reconstructed on 
the opposite side of the primary vertex with L 2 £> significance less than —6.0 in a generic jet 
sample [l3j. The mistag rate is corrected by a factor of 1.3±0.1 to account for the remaining 
contribution of long-lived light-flavor hadrons {K$ and A) and material interactions that are 
present only at positive L 2 d- 

The it acceptance calculation is based on the pythia [14j Monte Carlo simulation, with 
the CTEQ5L parton distribution functions 



Heavy-flavor decays are treated by EvtGen 



15fl and assuming a top mass of 175 GeV/c 2 . 
161 ]. Monte Carlo events are passed through a 



GEANT [17J simulation of the detector and subjected to the same selection requirements as 
the data. The total acceptance before ^-tagging is 4.3±0.1% (3.5±0.1%) for electron (muon) 
events and includes the branching fraction of the W boson, the geometric and kinematic 
acceptances, and the lepton identification and trigger efficiencies. 

In previous cross section measurements |5|] the tagging efficiency was needed as an input 
in order to perform the measurement. In those cases, the efficiency for the full it event 
was determined from simulation, and a multiplicative scale factor (Si) was applied to the 
efficiency found in the simulation to correct for the per-jet efficiency difference between 
data and simulation. This difference is caused by imperfections in the simulation, arising 
from, for example, incomplete description of the silicon detector, tracking efficiencies, and 
hadron decay modeling. The scale factor S& was measured using high statistics data and 
simulation samples, enriched in heavy flavor by requesting non-isolated low-px leptons, and 
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used to correct the heavy-flavor tagging efficiency in all the simulated samples. In the 
current 1.12 fb _1 data sample, Sb has been measured to be Sb = 0.95 ± 0.05, where the 
uncertainty is dominated by the extrapolation from the low-px calibration sample to typical 



tt jet energies 



13|. Using this nominal scale factor in a tt sample we expect 46 ± 4% of 
the events to have exactly one 6-tagged jet and 22 ± 3% to have two or more ^-tagged jets. 
Charm-quark jets which are 6-tagged are treated analogously to 6-quark jets, but we assume 
a 10% uncertainty on the scale factor for charm, S c . 

IV. BACKGROUND ESTIMATION 

The primary background process is direct W production with multiple jets, where the W 
boson decays leptonically. Smaller contributions come from QCD jet production in which 
the W signature is faked by jets appearing as electrons or by semi-leptonic 6-hadron decays 
(non-W), and different electroweak processes: single-top-quark production, diboson (WW, 
WZ, and ZZ) production, and Z boson decays to tau pairs. 

The W+jets contribution to the background is separated into events with and without 
heavy-flavor jets. To estimate the background contribution from W events with only light- 
flavor jets (iV tag + ), the mistag rate is parametrized as a function of jet Et, f], and number 
of tracks, as well as event total energy, number of primary vertices, and primary vertex z 
position. This parametrization is applied to the pretag dataset, and the result is scaled 
down to the fraction of the data not attributed to a physics process with heavy-flavor. 

To determine the number of tags that originated from W events accompanied by heavy- 
flavor quarks, the fractions of W+jets events attributable to Wbb, Wcc, and Wc are esti- 
mated with alpgen Monte Carlo 18j. Since in the simulation heavy-flavor production may 
arise both from the soft-radiation evolution of a given event or directly from the parton con- 
figuration, a procedure to avoid double counting has been implemented [13J. The fractions 
of W+jets with heavy flavor (heavy-flavor fractions, Fi, where the index corresponds to the 
different contributions) are calibrated using data and simulated samples of inclusive jets. 
The bottom and charm fractions in the simulation are extracted from the generator infor- 
mation, while the equivalents in the data are obtained from template fits to the kinematic 
and dynamic properties of the tagged events. The measured heavy-flavor fraction calibra- 
tion factor is consistent with unity, except in the one-jet bin, where the value for the data 
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is about 30% higher, the difference is assigned as an uncertainty [13]. The expected number 
of W events with heavy-flavor (iV" tag + ) is estimated by multiplying these fractions by the 
number of pretag events (N^ e ) and the tagging efficiency in these events (e,), measured 
from simulation and corrected by the scale factor, iV tag + = N^ e J2i£iE- The contribu- 
tion of other backgrounds and the ti signal are removed from the N:% e pretag expectation: 
N™ e = N^ a - N£™~ w - N* e - N^, where A^ ta is the number of pretag events, iV*'~ e is 
the number of pretag events attributed to ti production, N™™~ w is the number of events 
without a W boson, and N^ is the contribution from single-top-quark production, Z boson 
decays to tau pairs, and diboson production. Both N™°™~ w and N™^ are discussed below. 
Some events without a W boson may satisfy the pretag requirements of an identified 
lepton and large missing energy. For example, photon conversions or mis-identified pions or 
kaons may comply with the lepton requirements, and the missing energy may arise due to 
mis-measured jets. Heavy-flavor di-jet production may also result in leptons from the semi- 
leptonic decay faking the W boson signal. The expectation for this non-W background is 
determined from data, by fitting the missing energy distribution in each jet multiplicity bin. 
The templates used in the fit are derived from pythia (ti) and alpgen (W+jets) Monte 
Carlo, and samples enriched in fake leptons and heavy flavor. The sample with fake leptons 
is obtained from events in data where the primary lepton fails at least two identification 



requirements [19|, while di-jet events with non-isolated low- Et leptons provide the heavy- 
flavor enriched sample. The it contribution is fixed to the value expected for the measured 
cross section. The fraction of non-H^ events is measured from the fits both before and after 
requiring a 6-tagged jet. 

A Monte Carlo-based method is used to estimate the remaining backgrounds due to sin- 



gle top quark production (pythia/madevent 20j). vector boson pair production (pythia 



and Z — > tt (alpgen interfaced to pythia), normalizing the expectations to their respec- 



tive theoretical cross sections 2l|. The tagging efficiency is taken from these Monte Carlo 
samples and corrected by Sb- 

V. RESULTS 

The scale factor enters the equation in the signal term as well as in the jy+jets with 
heavy-flavor and electroweak backgrounds. Instead of using the Sb value measured in the 
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low-pr lepton sample, in this paper we perform a simultaneous fit of the it cross section and 
Sb by requiring that the single and multiple-tagged samples correspond to the same it cross 
section. Since the W+jets backgrounds and, to a lesser extent, the non-jy background 
depend on the assumed it cross section, the procedure to determine them is iterative. The 
it contribution to the 6-tagged sample is written as N* g = iV tag - N^+ HF - N^+ LF - N^. 
Figure [T] shows the cross section measurement for exclusive single 6-tagged events, and for 
events with two or more 6-tagged jets as a function of Sb- A Poisson likelihood fit yields 
results of a t t — 8.2 ± 0.9 pb with only the statistical and scale factor uncertainties included, 
and Sb = 0.98 ± 0.07, consistent with the Sb value measured in the low-p^ lepton sample. 
The associated confidence bands are also shown in Fig. [TJ The signal and backgrounds 
contributions to the 6-tagged data sample are summarized in Tables [J and [Til for the single 
and the multiple-tagged data respectively, and are shown in Fig. [2] as a function of the jet 
multiplicity. 

The systematic uncertainties on the cross section, before 6-tagging requirements, are 
dominated by a 4.2% uncertainty due to the jet energy calibration. Other sources of uncer- 
tainty are the choice of Monte Carlo generator (2.1%), the lepton identification efficiency 

(2.0%), the choice of parton distribution functions (0.7%), and the modeling of initial and 

I 
final state radiation (0.5%). The uncertainty in the integrated luminosity is 6% [22]. A 

3.3% uncertainty associated to the background estimation was determined using a large 
ensemble of simulated experiments by fluctuating the backgrounds within their uncertain- 
ties. The different sources of systematic uncertainty are added in quadrature for the signal 
expectation [5]. 

In order to include the independent measurement of the Sb in the lepton sample (Sb = 
0.95 ± 0.05), a term to penalize deviations from this value is added to the likelihood. With 
this constraint we measure a cross section of 8.5±0.6(stat.) ±0.7(syst.) pb and a scale factor 
of 0.96 ± 0.04. With this Sb the fraction of selected it candidates with exactly one identified 
6-quark is 46 ± 4% and with two or more 6-quarks is 23 ± 3%. The statistical uncertainty of 
a t i also includes the uncertainty on Sb- 

In conclusion, we have performed the first simultaneous fit of the it production cross 
section and the 6-tagging efficiency in pp collisions at y/s = 1.96 TeV using data correspond- 
ing to an integrated luminosity of 1.12 fb . The cross section result, 8.5 ± 0.6(stat.) ± 
0.7(syst.) pb, is consistent with the SM expectation of 6. 7^7 pb for a mass of 175 GeV/c 2 . 
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The dependence of the acceptance on the top mass results in a variation of the measured 
cross section by ±0.1 pb for every =pl GeV/c 2 shift in the assumed top-quark mass. With 
the innovative technique presented in this Letter the 6-tagging efficiency for high-.Sr 6-jets 
was directly measured in the it sample, and its uncertainty reduced with respect to previous 
results [5|. Future measurements at the Large Hadron Collider, like searches for the Higgs 
boson and supersymmetric particles, where 6-tagging performance is critical, could benefit 
from this technique. 
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TABLE II: Summary of event yields and background (Bkgd) expectations sorted by the number of 
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of 8.5 pb. For events with more than 2 jets the Ht > 200 GeV requirement is applied. 
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FIG. 1: Dependence of the exclusive single and multiple tagged cross sections on the 6-tagging 
efficiency scale factor. The point where the two measurements intersect is in good agreement 
with the independent determination of the 6-tag scale factor from low-pT data. The associated 
confidence bands are also shown. 
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FIG. 2: Summary of background and signal event yields versus number of jets in the event when 
requiring (a) one 6-tagged jet and (b) at least two 6-tagged jets. The it contribution is normalized 
to the measured cross section. The Ht requirement is released for events with fewer than 3 jets. 
The hashed region shows the total systematic uncertainty on the expectation. 
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